Data Editing and Imputation in Business Surveys Using “R”
نویسنده
چکیده
Purpose – Missing data are a recurring problem that can cause bias or lead to ineffi cient analyses. The objective of this paper is a direct comparison between the two statistical software features R and SPSS, in order to take full advantage of the existing automated methods for data editing process and imputation in business surveys (with a proper design of consistency rules) as a partial alternative to the manual editing
منابع مشابه
Use of R in Business Surveys at the Italian National Institute of Statistics: Experiences and Perspectives
Over the last six years, R has been steadily gaining ground in Istat, since a strategic decision to limit dependence on proprietary technologies (like SAS) was taken. A migration activity of our critical IT tools from SAS to R was carried out (we can cite MAUSS-R for optimal sample allocation, and ReGenesees for the calculation of estimates and sampling errors), and new R packages were develope...
متن کاملMultivariate Outlier Detection and Treatment in Business Surveys
Multivariate outlier detection based on the Mahalanobis distance with the BACON-EEM algorithm, the TRC algorithm and the ER algorithm is presented and imputation of outliers and further missing values is discussed. The methods are illustrated with a data set on Swedish municipalities. The relation between outliers, influential observations and selective editing is explored. Finally robust multi...
متن کاملImputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method
The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...
متن کاملMicrodata Imputations and Macrodata Implications: Evidence from the Ifo Business Survey
A widespread method for nowand forecasting economic macro level parameters such as GDP growth rates are survey-based indicators which contain early information in contrast to official data. But surveys are commonly affected by nonresponding units which can produce biases if these missing values can not be regarded as missing at random. As many papers examined the effect of nonresponse in indivi...
متن کاملاهمیت خویشاوندی ژنتیکی و رکورد فنوتیپی بر صحت ژنومی دادههای جانهی شبیه سازی شده با استفاده از مدل های حیوانی در حضور اثرات متقابل ژنوتیپ و محیط
The objective of this study was to investigate the role of genetic relationships between training and validation set with considering different ratio of phenotypic records of training set on accuracy of genomic prediction via animal models containing genotype × environment interactions in simulated imputation data. For this purpose, four different scenarios using 15k density containing differen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014